268

17

Genomics

indeed the human gastrointestinal tract; cf. Chap. 19). It may sometimes be necessary

to clone the DNA (using cultured laboratory bacteria) in order to produce sufficient

material for further analysis.

The most basic analysis is simply to sequence all the DNA. This may result

in millions of genes, which can be compared with known sequences; early work

appeared to reveal an astonishing diversity of whole classes of hitherto unknown

genes. One should, however, be mindful of the influence of sequencing errors in

giving the appearance of more novelty than is actually the case. 30

Function-based metagenomics obviates the need to sequence the DNA by letting

the fragments be translated, again in laboratory-cultured bacteria. Novel proteins of

phenotypes are then further analysed.

Metagenomics can be used to better understand the diversity and functions of

microbial communities in a wide range of habitats, and to gain insights into the

evolutionary processes that shape the communities. It can also be used to detect and

control potentially disease-causing organisms, and explore the potential of microbial

communities for producing substances useful to mankind.

References

Allison L, Powell D, Dix TI (1999) Compression and approximate matching. Comput J 42:1–10

Bibikova M, Fan JB (2010) Genome-wide DNA methylation profiling. WIREs Syst Biol Med

2:210–223

Bibikova M et al (2006) High-throughput DNA methylation profiling using universal bead arrays.

Genome Res 16:383–393

Braslavsky I, Hebert B, Kartalov E, Quake SR (2003) Sequence information can be obtained from

single DNA molecules. Proc Natl Acad Sci USA 100:3960–3964

Brinkman AB, Simmer F, Ma K, Kaan A, Zhu J, Stunnenberg HG (2010) Whole-genome DNA

methylation profiling using MethylCap-seq. Methods 52:232–236

Committee on Metagenomics: Challenges and Functional Applications (2007) The new science of

metagenomics. National Academies Press, Washington DC

França LTC, Carrilho E, Kist TB (2002) A review of DNA sequencing techniques. Q Rev Biophys

35:169–200

Gorban AN, Popova TG, Sadovsky MG (2000) Classification of symbol sequences over their fre-

quency dictionaries. Open Syst Inform Dynam 7:1–17

Gorban AN, Popova TG, Zinovyev A (2005) Codon usage trajectories and 7-cluster structure of

143 complete bacterial genomic sequences. Physica A 353:365–387

Heyn H, Esteller M (2012) DNA methylation profiling in the clinic: applications and challenges.

Nat Rev Genet 13:679–692

IHGSC (International Human Genome Sequencing Consortium) (2001) Initial sequencing and

analysis of the human genome. Nature 409:860–921

Karlin S, Ladunga I, Blaisdell BE (1994) Heterogeneity of genomes: measures and values. Proc

Natl Acad Sci USA 91:12837–12841

Kruskal JB (1964) Multidimensional scaling by optimizing goodness of fit to a nonmetric hypoth-

esis. Psychometrika 29:1–27

30 Quince et al. (2009).